Logarithmic Time Online Multiclass prediction
نویسندگان
چکیده
We study the problem of multiclass classification with an extremely large numberof classes (k), with the goal of obtaining train and test time complexity logarith-mic in the number of classes. We develop top-down tree construction approachesfor constructing logarithmic depth trees. On the theoretical front, we formulate anew objective function, which is optimized at each node of the tree and createsdynamic partitions of the data which are both pure (in terms of class labels) andbalanced. We demonstrate that under favorable conditions, we can construct loga-rithmic depth trees that have leaves with low label entropy. However, the objectivefunction at the nodes is challenging to optimize computationally. We address theempirical problem with a new online decision tree construction procedure. Exper-iments demonstrate that this online algorithm quickly achieves improvement intest error compared to more common logarithmic training time approaches, whichmakes it a plausible method in computationally constrained large-k applications.
منابع مشابه
Logarithmic Time One-Against-Some
We create a new online reduction of multiclass classification to binary classification for which training and prediction time scale logarithmically with the number of classes. We show that several simple techniques give rise to an algorithm that can compete with one-against-all in both space and predictive power while offering exponential improvements in speed when the number of classes is large.
متن کاملLog-time and Log-space Extreme Classification
We present LTLS, a technique for multiclass and multilabel prediction that can perform training and inference in logarithmic time and space. LTLS embeds large classification problems into simple structured prediction problems and relies on efficient dynamic programming algorithms for inference. We train LTLS with stochastic gradient descent on a number of multiclass and multilabel datasets and ...
متن کاملThe price of bandit information in multiclass online classification
We consider two scenarios of multiclass online learning of a hypothesis class H ⊆ Y X . In the full information scenario, the learner is exposed to instances together with their labels. In the bandit scenario, the true label is not exposed, but rather an indication whether the learner’s prediction is correct or not. We show that the ratio between the error rates in the two scenarios is at most ...
متن کاملEfficient Online Multiclass Prediction on Graphs via Surrogate Losses
We develop computationally efficient algorithms for online multi-class prediction. Our construction is based on carefully-chosen data-dependent surrogate loss functions, and the new methods enjoy strong mistake bound guarantees. To illustrate the technique, we study the combinatorial problem of node classification and develop a prediction strategy that is linear-time per round. In contrast, the...
متن کاملEfficient Online Bandit Multiclass Learning with Õ(√T) Regret
We present an efficient second-order algorithm with Õ( 1 η √ T )1 regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by η, for a range of η restricted by the norm of the competitor. The family of loss functions ranges from hinge loss (η = 0) to squared hinge loss (η = 1). This provides a solution to the...
متن کامل